55 research outputs found

    Topological variation in single-gene phylogenetic trees

    Get PDF
    A large-scale phylogenetic study of the human lineage dramatically points up the problems of using single genes to build phylogenetic trees

    The SAR11 Group of Alpha-Proteobacteria Is Not Related to the Origin of Mitochondria

    Get PDF
    Although free living, members of the successful SAR11 group of marine alpha-proteobacteria contain a very small and A+T rich genome, two features that are typical of mitochondria and related obligate intracellular parasites such as the Rickettsiales. Previous phylogenetic analyses have suggested that Candidatus Pelagibacter ubique, the first cultured member of this group, is related to the Rickettsiales+mitochondria clade whereas others disagree with this conclusion. In order to determine the evolutionary position of the SAR11 group and its relationship to the origin of mitochondria, we have performed phylogenetic analyses on the concatenation of 24 proteins from 5 mitochondria and 71 proteobacteria. Our results support that SAR11 group is not the sistergroup of the Rickettsiales+mitochondria clade and confirm that the position of this group in the alpha-proteobacterial tree is strongly affected by tree reconstruction artefacts due to compositional bias. As a consequence, genome reduction and bias toward a high A+T content may have evolved independently in the SAR11 species, which points to a different direction in the quest for the closest relatives to mitochondria and Rickettsiales. In addition, our analyses raise doubts about the monophyly of the newly proposed Pelagibacteraceae family

    What Was the Set of Ubiquitin and Ubiquitin-Like Conjugating Enzymes in the Eukaryote Common Ancestor?

    Get PDF
    Ubiquitin (Ub)-conjugating enzymes (E2) are key enzymes in ubiquitination or Ub-like modifications of proteins. We searched for all proteins belonging to the E2 enzyme super-family in seven species (Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Schizosaccharomyces pombe, Saccharomyces cerevisiae, and Arabidopsis thaliana) to identify families and to reconstruct each family’s phylogeny. Our phylogenetic analysis of 207 genes led us to define 17 E2 families, with 37 E2 genes, in the human genome. The subdivision of E2 into four classes did not correspond to the phylogenetic tree. The sequence signature HPN (histidine–proline–asparagine), followed by a tryptophan residue at 16 (up to 29) amino acids, was highly conserved. When present, the active cysteine was found 7 to 8 amino acids from the C-terminal end of HPN. The secondary structures were characterized by a canonical alpha/beta fold. Only family 10 deviated from the common organization because the proteins were devoid of enzymatic activity. Family 7 had an insertion between beta strands 1 and 2; families 3, 5 and 14 had an insertion between the active cysteine and the conserved tryptophan. The three-dimensional data of these proteins highlight a strong structural conservation of the core domain. Our analysis shows that the primitive eukaryote ancestor possessed a diversified set of E2 enzymes, thus emphasizing the importance of the Ub pathway. This comprehensive overview of E2 enzymes emphasizes the diversity and evolution of this superfamily and helps clarify the nomenclature and true orthologies. A better understanding of the functions of these enzymes is necessary to decipher several human diseases

    Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Model violations constitute the major limitation in inferring accurate phylogenies. Characterizing properties of the data that are not being correctly handled by current models is therefore of prime importance. One of the properties of protein evolution is the variation of the relative rate of substitutions across sites and over time, the latter is the phenomenon called heterotachy. Its effect on phylogenetic inference has recently obtained considerable attention, which led to the development of new models of sequence evolution. However, thus far focus has been on the quantitative heterogeneity of the evolutionary process, thereby overlooking more qualitative variations.</p> <p>Results</p> <p>We studied the importance of variation of the site-specific amino-acid substitution process over time and its possible impact on phylogenetic inference. We used the CAT model to define an infinite mixture of substitution processes characterized by equilibrium frequencies over the twenty amino acids, a useful proxy for qualitatively estimating the evolutionary process. Using two large datasets, we show that qualitative changes in site-specific substitution properties over time occurred significantly. To test whether this unaccounted qualitative variation can lead to an erroneous phylogenetic tree, we analyzed a concatenation of mitochondrial proteins in which Cnidaria and Porifera were erroneously grouped. The progressive removal of the sites with the most heterogeneous CAT profiles across clades led to the recovery of the monophyly of Eumetazoa (Cnidaria+Bilateria), suggesting that this heterogeneity can negatively influence phylogenetic inference.</p> <p>Conclusion</p> <p>The time-heterogeneity of the amino-acid replacement process is therefore an important evolutionary aspect that should be incorporated in future models of sequence change.</p

    Phylogenomic evidence for a common ancestor of mitochondria and the SAR11 clade

    Get PDF
    Mitochondria share a common ancestor with the Alphaproteobacteria, but determining their precise origins is challenging due to inherent difficulties in phylogenetically reconstructing ancient evolutionary events. Nonetheless, phylogenetic accuracy improves with more refined tools and expanded taxon sampling. We investigated mitochondrial origins with the benefit of new, deeply branching genome sequences from the ancient and prolific SAR11 clade of Alphaproteobacteria and publicly available alphaproteobacterial and mitochondrial genome sequences. Using the automated phylogenomic pipeline Hal, we systematically studied the effect of taxon sampling and missing data to accommodate small mitochondrial genomes. The evidence supports a common origin of mitochondria and SAR11 as a sister group to the Rickettsiales. The simplest explanation of these data is that mitochondria evolved from a planktonic marine alphaproteobacterial lineage that participated in multiple inter-specific cell colonization events, in some cases yielding parasitic relationships, but in at least one case producing a symbiosis that characterizes modern eukaryotic life

    Addressing Inter-Gene Heterogeneity in Maximum Likelihood Phylogenomic Analysis: Yeasts Revisited

    Get PDF
    Phylogenomic approaches to the resolution of inter-species relationships have become well established in recent years. Often these involve concatenation of many orthologous genes found in the respective genomes followed by analysis using standard phylogenetic models. Genome-scale data promise increased resolution by minimising sampling error, yet are associated with well-known but often inappropriately addressed caveats arising through data heterogeneity and model violation. These can lead to the reconstruction of highly-supported but incorrect topologies. With the aim of obtaining a species tree for 18 species within the ascomycetous yeasts, we have investigated the use of appropriate evolutionary models to address inter-gene heterogeneities and the scalability and validity of supermatrix analysis as the phylogenetic problem becomes more difficult and the number of genes analysed approaches truly phylogenomic dimensions. We have extended a widely-known early phylogenomic study of yeasts by adding additional species to increase diversity and augmenting the number of genes under analysis. We have investigated sophisticated maximum likelihood analyses, considering not only a concatenated version of the data but also partitioned models where each gene constitutes a partition and parameters are free to vary between the different partitions (thereby accounting for variation in the evolutionary processes at different loci). We find considerable increases in likelihood using these complex models, arguing for the need for appropriate models when analyzing phylogenomic data. Using these methods, we were able to reconstruct a well-supported tree for 18 ascomycetous yeasts spanning about 250 million years of evolution

    What are the consequences of combining nuclear and mitochondrial data for phylogenetic analysis? Lessons from Plethodon salamanders and 13 other vertebrate clades

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The use of mitochondrial DNA data in phylogenetics is controversial, yet studies that combine mitochondrial and nuclear DNA data (mtDNA and nucDNA) to estimate phylogeny are common, especially in vertebrates. Surprisingly, the consequences of combining these data types are largely unexplored, and many fundamental questions remain unaddressed in the literature. For example, how much do trees from mtDNA and nucDNA differ? How are topological conflicts between these data types typically resolved in the combined-data tree? What determines whether a node will be resolved in favor of mtDNA or nucDNA, and are there any generalities that can be made regarding resolution of mtDNA-nucDNA conflicts in combined-data trees? Here, we address these and related questions using new and published nucDNA and mtDNA data for <it>Plethodon </it>salamanders and published data from 13 other vertebrate clades (including fish, frogs, lizards, birds, turtles, and mammals).</p> <p>Results</p> <p>We find widespread discordance between trees from mtDNA and nucDNA (30-70% of nodes disagree per clade), but this discordance is typically not strongly supported. Despite often having larger numbers of variable characters, mtDNA data do not typically dominate combined-data analyses, and combined-data trees often share more nodes with trees from nucDNA alone. There is no relationship between the proportion of nodes shared between combined-data and mtDNA trees and relative numbers of variable characters or levels of homoplasy in the mtDNA and nucDNA data sets. Congruence between trees from mtDNA and nucDNA is higher on branches that are longer and deeper in the combined-data tree, but whether a conflicting node will be resolved in favor mtDNA or nucDNA is unrelated to branch length. Conflicts that are resolved in favor of nucDNA tend to occur at deeper nodes in the combined-data tree. In contrast to these overall trends, we find that <it>Plethodon </it>have an unusually large number of strongly supported conflicts between data types, which are generally resolved in favor of mtDNA in the combined-data tree (despite the large number of nuclear loci sampled).</p> <p>Conclusions</p> <p>Overall, our results from 14 vertebrate clades show that combined-data analyses are not necessarily dominated by the more variable mtDNA data sets. However, given cases like <it>Plethodon</it>, there is also the need for routine checking of incongruence between mtDNA and nucDNA data and its impacts on combined-data analyses.</p

    After the Fukushima Daiichi Accident, Extending the Human and Organizational Factors (HOF) Framework to Safety Regulation

    No full text
    International audienceThe accident of Fukushima-Daichi is regarded as a product of multiple failures of thenuclear risks regulation system in Japan and more particularly as a failure of the regulatorysystem (authorities, regulator and operator) to take into account seismic risks and floodrisks caused by tsunamis. This statement conducted the French institute for radiologicalprotection and nuclear safety (IRSN) to develop a research program dedicated to the studyof the way the French nuclear regulatory system developed and addresses flood risk
    corecore